CDS

Accession Number TCMCG036C09114
gbkey CDS
Protein Id PTQ41575.1
Location complement(join(49696..50347,50744..50778,50948..51364,51726..52058,52645..52711,53056..53122,53249..53374,53658..54000,54118..54279,54483..54578,54739..54870,55132..55356,56151..56279,57103..57186,57310..57465,57646..57786))
GeneID Phytozome:Mapoly0033s0004
Organism Marchantia polymorpha
locus_tag MARPO_0033s0004

Protein

Length 1054aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA53523, BioSample:SAMN00769973
db_source KZ772705.1
Definition hypothetical protein MARPO_0033s0004 [Marchantia polymorpha]
Locus_tag MARPO_0033s0004

EGGNOG-MAPPER Annotation

COG_category K
Description Homeodomain
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE -
KEGG_ko -
EC -
KEGG_Pathway -
GOs GO:0003674        [VIEW IN EMBL-EBI]
GO:0003676        [VIEW IN EMBL-EBI]
GO:0003677        [VIEW IN EMBL-EBI]
GO:0003697        [VIEW IN EMBL-EBI]
GO:0005488        [VIEW IN EMBL-EBI]
GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005634        [VIEW IN EMBL-EBI]
GO:0005730        [VIEW IN EMBL-EBI]
GO:0006355        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0009889        [VIEW IN EMBL-EBI]
GO:0009890        [VIEW IN EMBL-EBI]
GO:0009892        [VIEW IN EMBL-EBI]
GO:0010468        [VIEW IN EMBL-EBI]
GO:0010556        [VIEW IN EMBL-EBI]
GO:0010558        [VIEW IN EMBL-EBI]
GO:0010605        [VIEW IN EMBL-EBI]
GO:0010629        [VIEW IN EMBL-EBI]
GO:0019219        [VIEW IN EMBL-EBI]
GO:0019222        [VIEW IN EMBL-EBI]
GO:0031323        [VIEW IN EMBL-EBI]
GO:0031324        [VIEW IN EMBL-EBI]
GO:0031326        [VIEW IN EMBL-EBI]
GO:0031327        [VIEW IN EMBL-EBI]
GO:0031974        [VIEW IN EMBL-EBI]
GO:0031981        [VIEW IN EMBL-EBI]
GO:0043226        [VIEW IN EMBL-EBI]
GO:0043227        [VIEW IN EMBL-EBI]
GO:0043228        [VIEW IN EMBL-EBI]
GO:0043229        [VIEW IN EMBL-EBI]
GO:0043231        [VIEW IN EMBL-EBI]
GO:0043232        [VIEW IN EMBL-EBI]
GO:0043233        [VIEW IN EMBL-EBI]
GO:0044422        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044428        [VIEW IN EMBL-EBI]
GO:0044446        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0045892        [VIEW IN EMBL-EBI]
GO:0045934        [VIEW IN EMBL-EBI]
GO:0048519        [VIEW IN EMBL-EBI]
GO:0048523        [VIEW IN EMBL-EBI]
GO:0050789        [VIEW IN EMBL-EBI]
GO:0050794        [VIEW IN EMBL-EBI]
GO:0051171        [VIEW IN EMBL-EBI]
GO:0051172        [VIEW IN EMBL-EBI]
GO:0051252        [VIEW IN EMBL-EBI]
GO:0051253        [VIEW IN EMBL-EBI]
GO:0060194        [VIEW IN EMBL-EBI]
GO:0060195        [VIEW IN EMBL-EBI]
GO:0060255        [VIEW IN EMBL-EBI]
GO:0065007        [VIEW IN EMBL-EBI]
GO:0070013        [VIEW IN EMBL-EBI]
GO:0080090        [VIEW IN EMBL-EBI]
GO:0097159        [VIEW IN EMBL-EBI]
GO:1901363        [VIEW IN EMBL-EBI]
GO:1902679        [VIEW IN EMBL-EBI]
GO:1903506        [VIEW IN EMBL-EBI]
GO:1903507        [VIEW IN EMBL-EBI]
GO:2000112        [VIEW IN EMBL-EBI]
GO:2000113        [VIEW IN EMBL-EBI]
GO:2001141        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGATGGGATCCATGCAACTGGACGTAATTATGGCTGTGGAGGAATTGCACGGTCTGAGCGCCCGGGACATGAGCAAGCTTCTAAAGGACCTCCATAATTTCACGTTTCGATTCAACACGAGTAGAGGCGCTCTTATGCGGGTTGATGTACAGCGACTCGCAACGAGCTTGCCTCTGCATTTGCTTGCAAGTGTAGCATCCACAGGTGCCGAACCACGGTTCCGATATTTACTTCGAGGTGTTCGCCTCTTGCATTCTCTGAGCGACTTGGCTCTGCGGTACCCTAAATTGGAGCAGCTACTACTACACGAGGTGAAGCTAAGGGAGCAGGTCTTGGATTTAGTCATATACATGCTCATCGTCCTGGCCAGTGTAGAGCAGGAAGGCCGAGCTGGCAGCTTTGTGGCTTTGCTGCACGCAGCATTAGTGGCCTGCTGCCTCTATCTTTTGACAGCATTCGTGACGAAAGAATGGCACGATGTGGTGCCTGTTCTCTTGGCACACCCGAAGGTAGATGTATTCATGGACGCAGCATTTGATGTTGTAAGACGAGACATTGGGCTTTTGCAAAGTAAATTGCGCACCTTGCACATAAAGCTGGCAAGTAAGAAAGCAACTCTGCCTGCTGCAGAACGCGTTGGTGAGGCTACTGCACAACAATGCGAAGCATCTCTCCAAGTGCTTCAAAGTTTATGTCAAACGCAGGCATTCAGAGATCAGCTTCTACACCATAAGGAGCTCTGTACAAGGGGTGGAGTGCTGCGCCTGACACTCGCCGTTTTGAAGCTGAACATGCCCAGTTCGTTACAACAATCGAAGTCACTGGTTGCGTTCATTTCCCGTTTGAAATCCAAAGCATTATCAATGCTGCTAAAATTTTGCGAGACTGAGTCGGTGTGCTTTCTGGACGAAGTTGCAGCAGAGCCTCAAAGCATGCAATTGGCTGAACTTGTTGCTTGTGAGGTCCTCGCTTTGGTCAGAAGTGCCTTATTAGAGGAGCCGAGGCGAATCGAAGACGCTGAAACTGAGCAGAATCCTATGGGGTCTTTGCACATAAATGCTATGCGCCTAGCCGATCTTTTCTCGGACGATTCCAACTTTCGGAACTTGGTGATGGATGGCATTGCCCCAGATCTGGGAACAGTTCTGGCAGCACCACCGGGTGTATTCACCTCTCGTTGGTGTGAGGGTGATATAGCCGTTAACTTGGCAAAGGGAGAGATGGATGCAATTTTAGTCTATGACCCCTTTCAGGCAGCAGGTGCAGCAATGATAGCATCTGTAAAAGGTGCTACAGCCCCCGTATCTGGCTCCCCTGGAGACTCCCTTCAACCGTTAGAAGAGATGGAAACAGGATGTGTGCTCTCAATAGAGAGCACTTCACCTGATCTATATGCACGAGAACGCTCAGCACTGCTTGTCAAGCTTTTCGCTAATCTCACCTGCTTCAATCCAGAGGTTTGCTCAGTGGATGAGAAGGATCGCTTTTTGCACGTTTTCGTGGAGTGTCTTGCTTCAGGACCTCTGATGAAATCCAGCAGCTCTCATTTTTTGACTCCAGAGCAGACTGCTGTCCGTATTTGCGAGAATTTGTATGTGTTGCTTGATCATGTGGTGTCATCCTCACATACGGTTAACGATGAAGACTTGTCTCTTGTCAGCCAATTCTCTGACGCACTTCACCATGCAATTTGCCCATCTCCACCCTCTGACGAATCAACCTGTCTGGCTGCCCCAGTCGTGAGTTACCTGCGAGATGCACACGAGAAAAAATTGCTGATGATACATACTCGCCGTCAAGAGCTTCCTCGTTGGCAGAAGATTTATCAAAGTTCAATGGAGCTTCAGGAGAAGGAAGCTGTGGCTGTTTCTGTACGACCAAATGCTGTCTATGACAGAGGAGTTCCATGTCAAAATGGGATGGAGATTGAGAAGAAACAACAAGAGGTGAAAACCCTCTCAAGTGGTAATAATGAACACCACAAGGTAGCGGATGCTCCACTAGACATGTTGCTAATTGCCATTGAAGAAATGCAGAATCTACCATCGTGCAATAAAAGCATATTGCGTCAATCCAGCTCCATTCAACTTGCAATGTCAAGAGAGGAGAGTTCAGCTATCGCTGGTTTACATGAGGCAAGAGCCGAAAGCCTTATGGAACCTCTTGACGAACATGAATACGATGAGCGGCATATGGATGATGACGATGAGACTCTAACAAAAGAGTTCAATGACGGTGATTTTGCTGTGGAAAGTAATGAAGACGACAATGCAAGGGGTTCACAATTCATGAGAAATCCCAAAAGACGCAGGGAGTCCAAGAAAAGTTTGGATAGTGATGAACCACCAAAGAAAAGGAAACGGAACATTATGAATGAGCGTCAGATCAAGACTATTGAAGAGGCTTTGAAAGGAGAGCCTGAAATGCAGCGGTATCCCAAGCTAATACAGCAGTGGACAAATACTCTCAATCAGATGGGTCCAGAAATCGCTGTACACCAGCTTAAAAATTGGCTTAACAACAGAAAAGCGAGGCTAGCGCGGATAGCTAGAGAAGAACGAGCTGCAGAGGGCGATATGCGAGGTGAGAATTCTCGCCACACGAGGTCCGCTCTGGGAATTATGGGGGAGCAGTCGCAGGATATGACTGTCACCCCTCCTGGGTCTCCCGAAGGTACCAGAAGTGTGGACGGGAGTCAAAGAGATTCCGGCACCAAACAGCGAACAAGCAAGTCCTCCCGATCAGTGAGCAAGAGGGACTCTCAAGACAGAGGGACAACTCCAGAACTTTCAAATTTGGAGCCTGGTCACAAGCATCTAGGTGCTGACACGCTGGATTCACGAGGCGGTTCATCCTCAAACAGAGGAGGAAAGTGTAATTTAGGTCATGGTCAGTATGTTTCTCTTAGGGATAATGAAGACAAAGAAGTTGCATTAGGATACATTGTGCAATTGGACGGGGTCTGGCAGAGTAGAGATCTTGAAGAACAGAGTCTTTGTGTGATACAAGTGGTAGAATTCAAAGTGGATAAAACATCCAAATTGCCCTACCCTTCCAACTTGACAGGGTCATCATTTGAAGAAGCTGAAACACTACTTGGGAAAATCAGGGTTGCTTGGCACAAAGACAAGATTGAAATTGTCCAGGACGGTATCAGGAAGTAG
Protein:  
MMGSMQLDVIMAVEELHGLSARDMSKLLKDLHNFTFRFNTSRGALMRVDVQRLATSLPLHLLASVASTGAEPRFRYLLRGVRLLHSLSDLALRYPKLEQLLLHEVKLREQVLDLVIYMLIVLASVEQEGRAGSFVALLHAALVACCLYLLTAFVTKEWHDVVPVLLAHPKVDVFMDAAFDVVRRDIGLLQSKLRTLHIKLASKKATLPAAERVGEATAQQCEASLQVLQSLCQTQAFRDQLLHHKELCTRGGVLRLTLAVLKLNMPSSLQQSKSLVAFISRLKSKALSMLLKFCETESVCFLDEVAAEPQSMQLAELVACEVLALVRSALLEEPRRIEDAETEQNPMGSLHINAMRLADLFSDDSNFRNLVMDGIAPDLGTVLAAPPGVFTSRWCEGDIAVNLAKGEMDAILVYDPFQAAGAAMIASVKGATAPVSGSPGDSLQPLEEMETGCVLSIESTSPDLYARERSALLVKLFANLTCFNPEVCSVDEKDRFLHVFVECLASGPLMKSSSSHFLTPEQTAVRICENLYVLLDHVVSSSHTVNDEDLSLVSQFSDALHHAICPSPPSDESTCLAAPVVSYLRDAHEKKLLMIHTRRQELPRWQKIYQSSMELQEKEAVAVSVRPNAVYDRGVPCQNGMEIEKKQQEVKTLSSGNNEHHKVADAPLDMLLIAIEEMQNLPSCNKSILRQSSSIQLAMSREESSAIAGLHEARAESLMEPLDEHEYDERHMDDDDETLTKEFNDGDFAVESNEDDNARGSQFMRNPKRRRESKKSLDSDEPPKKRKRNIMNERQIKTIEEALKGEPEMQRYPKLIQQWTNTLNQMGPEIAVHQLKNWLNNRKARLARIAREERAAEGDMRGENSRHTRSALGIMGEQSQDMTVTPPGSPEGTRSVDGSQRDSGTKQRTSKSSRSVSKRDSQDRGTTPELSNLEPGHKHLGADTLDSRGGSSSNRGGKCNLGHGQYVSLRDNEDKEVALGYIVQLDGVWQSRDLEEQSLCVIQVVEFKVDKTSKLPYPSNLTGSSFEEAETLLGKIRVAWHKDKIEIVQDGIRK